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Abstract 



We count the number and patterns of pairs and tuples of independent events 
' in a simple random experiment: first a fair coin is flipped and then a fair die 

^ ' is tossed. The first number, equal to 888,888, suggest that there are some open 

I questions about the structure of independence even in a finite sample space. We 



discuss briefly these questions and possible approaches to answer them. 



^ ' Introduction. In pLj A. Kolmogorov wrote: "The concept of mutual independence 

H ■ 

of two or more experiments liolds, in a certain sense, a central position in the theory 
of probability." However, although the definition of independence is straightforward, it 
seems that even for a very simple sample space it is not quite clear what the "sources" 
of independence are and how to describe them. 

Let us consider the following simple random experiment: first we flip a coin and then 
we toss a die. Our sample space consists of 12 outcomes each having a probability of 
1/12. This experiment is used in many textbook as an illustration of the concept of 
independent events. However, as far as we know, no one has asked the following two 
simple questions. 
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Question 1. How many different pairs {A,B) of independent events are tliere ? 
Question 2. How many different tupies of independent events (^1,^42, ■■■,Ak) are 
tfiere ? 

Let tfie numbers Ki and K2 be tfie answers to tfiese questions. Tliey can be calculated 
exactly, and are a bit puzzling. The first number Ki is equal to a rather strange looking 
number 888, 888, which may seem too large. On the other hand, the number Ki may 
also seem too small. It is only about 10% of the number of all possible unordered pairs 
{A, B) of subsets of this sample space, which is | * 2^^ ^ (^2^2 ^ ^ 399^ 656. The 
second number, K2 — 30, 826, 488. As we will see, most of these pairs and tuples are 
isomorphic and can be obtained in a small number of different ways or patterns. This 
raises one more question: are the answers to Question 1 and Question 2 counterintuitive 
or not ? and what do they suggest ? It seem they suggest two open problems. 

Problem 1. How many different patterns of independence are there for any (finite) 
sample space and how to describe them ? 

In fact, when we think about independent events, we have in mind two different 
concepts covered by the same definition. The first is: independent events are those 
which are produced by two or more independent sources of randomness, i.e. independent 
"random generators." The second is: information that B has occurred does not changed 
the probability of A. So it seems that there is another open question. 

Problem 2. How to distinguish the two concepts mentioned above? 

In the next section we calculate the numbers Ki and K2-, and in the last section we 
briefly discuss Problems 1 and 2. We are far from giving conclusive answers to these 
questions and hope that this small note will encourage other probabilists to express their 
opinions. We also note that there is a huge literature treating these questions but we 
did not find direct relations to our example and even a brief survey will substantially 
increase the size of this note. 

The author would like to thank Harold Reiter, Ernst Presman and Michael Grabchak 
who read the first version of this paper and made valuable comments. 

Answers to Questions 1 and 2. Let us denote 1^41 = a, \B\ — 6, and \AB\ — d, 
where AB = Af] B. Since the order in the pair (A, B) does not matter, without loss of 
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generality we can assume that a < b. Formally, two events are always independent if one 
of them is the whole sample space or an empty set but this is not "real independence" 
so we will assume that < a,b < 12. Then it is easy to see that 1 < d < a,b. While 
potentially d — 1,2, 10, we will see later that d < 6, (see also Proposition 1 in section 
3). 

lfd=l, then ^ = i.e. a6 = 12 = 3*4 = 2*6. If a = 3, 6 = 4 then there are (\^) 
choices for AB, (^^^ for the two remaining elements of A, and choices for the three 
remaining elements of B. Similar reasoning works for the case when a — 2, b — 6. Hence 
the number of such pairs is (\^) (") (3) = 55,440 = rii, and (\^) (") (5") = 12*11*252 = 
33, 264 = 712. 

If two events A and B arc independent then the pairs A and B'^ , A'" and B, and 
A^ and B'-' are also independent. This means that if a = 3, 6 = 4 then we will have 
three more cases. Slightly abusing notation, and keeping the assumption that a < 6, we 
can describe them as: 2) a = 3, 6 = 8, d = 2; 3) a = 4, 6 = 9, d = 3; and 4) a — S,b — 9, 
d = 6. In all four cases the number of pairs is the same, rii = 55, 440. We can describe 
all of them by an (ordered) partition A^i = [1,2,3,6]. The multinomial coefficient Ui is 
the total number of such partitions. Each element of any partition specifies a unique 
pair of independent events. 

For the case a = 2, 6 = 6, d = 1 we have only one more case a — 6, b — 10, d — 5, 
because in this case \B\ = \B^\ = 6. We can describe these two cases by a partition 
N2 = [1, 1, 5, 5]. The number of pairs in each case is the multinomial coefficient 

If ci = 2, then ^ = i.e. a6 = 24 = 3 * 8 = 4 * 6. We do not consider other 

factorizations since they will not satisfy other restrictions, e.g. 24 = 2 * 12 since d — 
2 < a,b. Hence the number of such pairs is rii in the first case and ns = (^^^^ (^2°) (4) = 
66 * 45 * 70 = 207, 900 in the second case. The number 713 corresponds to a partition 
A^3 = [2, 2, 4, 4], which produces one more case a = 6, 6 = 8, d = 4. 

If 0? = 3, then ^ = i-e. a6 = 36 = 4 * 9 = 6 * 6. Again, we must skip some 
other factorizations. The factorization 36 = 4 * 9 corresponds to a partition A^i. The 
factorization 36 = 6 * 6 corresponds to a partition N4 — [3, 3, 3, 3], which produces the 
number of pairs = ^Q) Q = ^(220 * 84 * 20) = 184, 800. Note that the factor | 



is present because in a pair of sets {A, B) both have the same size 6. 

If 0? = 4, then a6 = 48 = 6 * 8. The other factorizations are impossible. This is a case 
from partition A^3. Hence the number of such pairs is n^. 

If c? = 5, then similarly a6 = 5*12 = 60 = 6*10. This is a case from partition N2- 
Hence the number of such pairs is n2. 

If d = 6, then similarly a6 = 6*12 = 72 = 8*9. This is a case from partition A^i. 
Hence the number of such pairs is rii. 

If d > 7, then ah — V2d and it is easy to see that there no possible factorizations 



since d < a, 6 < 12. 

In summary we have the following table of all possible pairs 



d] ah 


1 ; 12 


2 ; 24 


3 ; 36 


4 ; 48 


5 ; 60 


6 ; 72 


d*h — 


3*4,2*6 


3*8,4*6 


4 * 9,6 * 6 


6*8 


6* 10 


8*9 


Partition 


A^i ; N2 


A^i ; A^3 


A^i ; A^4 


A^3 


N2 





And the total is number Ki = 4ni + 2^2 + 2^3 + ^4 = 888, 888. 

Answer to Question 2. If jASCl — e — 1, and \A\ — a, \B\ — h, \C\ — c then 
^ = a6c = 144 = 3 * 3 * 2^ 6 * 6 * 4. Since pairs {A,B), {A, C) and {B, C) must 

also be independent, we consider only the decompositions of ahc into factors such that 
corresponding pairs are present in our Table. Thus, though e.g. 144 = 3*6*8 = 9*4*4, 
since there is no pairs 3*6, or 4 * 4 in our table, we do not analyze them. A triplet 
A,B,C with e — 1, and a = 6 = 6,c = 4, implies automatically through ^ * ^ — ^ 
and other similar equalities that \AB\ = 3, \AC\ = \BC\ = 2, and a partition A^5 = 
[1,1,1,1,2,2,2,2]. Then there are (\^) choices for ABC, (^^^ for the five remaining 
elements of A, choices for the two elements of B which are in ABC^, choices 
for the three elements of B outside of A, choices for the element of C in AB'^C, 

choices for the element of C in A'^BC, and choices for the element of C outside 
of AU B. We must also divide the product by two because sets A and B both have 
the same size 6 and their order does not matter. The total number of such triplets is 
ris = (1^1) g) g) g) * i = 12 * 462 * 10 * 20 * 3 * 3 * 3 * i = 14, 968, 800. 
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lie = 2, then abc = 2*144 = 3*3*2^ = 6*6*8 = 8*9*2 = ... Products 6*6 and 6*8 
are present in our table but 2*9 and other are not so we need only to analyze any triplet 
with a = 6 = 6, c = 8, and hence automatically with \AB\ — 3, \AC\ — \BC\ — 4. This 
is the other case for the partition above, and hence the total number of such triplets 
is again = (4°) {fj (3) (^) (^) g) * i = 66 * 210 * 4 * 20 * 3 * 3 * 3 * | = 14, 968, 600. 

If e = 3, then a6c = 3* 144 = 3*3*3* 2^ = 6*6* 12 = 8*9*6 = .... For 
any factorization there are pairwise products which are not in our table. If e > 3 then 
abc = 6*3*3*4*4 and then at least one of a, 6, c must be at least 12. Similar reasoning 

shows that there is no four or more independent events. 

Thus the total number of independent tuples, including pairs, is K2 = Ki + 2n^ = 
30,826,488. 

Note that if \S\ — 11, or 13, and p{s) — 1/\S\ then for such sample spaces there are 
no independent events at all. The following remark is due to E. Presman: 

Proposition 1. If a sample space S consists of n equally likely outcomes and there 
are k independent events with r common points then 

< ^ < {[n/2]/nf. (1) 

The proof follows from the equality P{Ai...Ak) — P{Ai)...P{Ak), where the left side 
is equal to ^, and in the right side each set Ai can be replaced by its complement if 

necessary. 

Discussion. We have found four different patterns of independence in our example. 
We do not have a definitive answer to Problem 1 in the general case. A possible approach 
to Problem 2 is as follows. If we shghtly change the probabihties of the sample points, 
the independence will disappear for almost all pairs, and it is easy to show that there 
are arbitrarily small perturbations which result in the complete absence of independent 
events. On the other hand, if we change arbitrarily the probabilities of the independent 
"generators", i.e.if, in our case, we have a biased coin and a biased die then about one 
hundred events will stay independent anyway, i.e.they are "truly independent" . A well- 
known example due to S. Bernstein gives three dependent events, such that any pair of 
these events is independent; such independence is unstable ! The same is true of the 
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following simpler example. There are three coins, two of them are fair and are flipped 
independently, the third coins is placed H(ead) up, if the first coin and the second show 
H, T or T, H, and placed T(ail) otherwise. The three coins are not independent but 
each pair of coins is. 

Similar questions about the structure of random generators and independent events 
make sense if we consider conditionally independent events. They correspond to random 
generators triggered by the outcomes of other random generators. 

To conclude this note we give another citation from a subsection on independence 
in [1] : "In consequence, one of the most important problems in the philosophy of the 
natural sciences is - in addition to the well-known one regarding the essence of the 
concept of probability itself - to make precise the premises which would make it possible 
to regard any given real events as independent. This question, however, is beyond the 
scope of this book." 

References 

[1] Kolmogorov A.N., 1956, (1933). Foundations of the Theory of Probability. NY, 
Chelsea Publ. Company. 



6 



